Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 40
Filtrar
1.
Biology (Basel) ; 13(4)2024 Apr 18.
Artigo em Inglês | MEDLINE | ID: mdl-38666884

RESUMO

Obesity is a socially significant disease that is characterized by a disproportionate accumulation of fat. It is also associated with chronic inflammation, cancer, diabetes, and other comorbidities. Investigating biomarkers and pathological processes linked to obesity is especially vital for young individuals, given their increased potential for lifestyle modifications. By comparing the genetic, proteomic, and metabolomic profiles of individuals categorized as underweight, normal, overweight, and obese, we aimed to determine which omics layer most accurately reflects the phenotypic changes in an organism that result from obesity. We profiled blood plasma samples by employing three omics methodologies. The untargeted GC×GC-MS metabolomics approach identified 313 metabolites. To augment the metabolomic dataset, we integrated a label-free HPLC-MS/MS proteomics method, leading to the identification of 708 proteins. The genomic layer encompassed the genotyping of 647,250 SNPs. Utilizing omics data, we trained sparse Partial Least Squares models to predict body mass index. Molecular features exhibiting frequently non-zero coefficients were selected as potential biomarkers, and we further explored enriched biological pathways. Proteomics was the most effective in single-omics analyses, with a median absolute error (MAE) of 5.44 ± 0.31 kg/m2, incorporating an average of 24 proteins per model. Metabolomics showed slightly lower performance (MAE = 6.06 ± 0.33 kg/m2), followed by genomics (MAE = 6.20 ± 0.34 kg/m2). As expected, multiomic models demonstrated better accuracy, particularly the combination of proteomics and metabolomics (MAE = 4.77 ± 0.33 kg/m2), while including genomics data did not enhance the results. This manuscript is the first multiomics study of obesity in a gender-balanced cohort of young adults profiled by genomic, proteomic, and metabolomic methods. The comprehensive approach provides novel insights into the molecular mechanisms of obesity, opening avenues for more targeted interventions.

2.
Biology (Basel) ; 12(12)2023 Dec 06.
Artigo em Inglês | MEDLINE | ID: mdl-38132320

RESUMO

The long-read RNA sequencing developed by Oxford Nanopore Technologies provides a direct quantification of transcript isoforms, thereby making it possible to present alternative splicing (AS) profiles as arrays of single splice variants with different abundances. Additionally, AS profiles can be presented as arrays of genes characterized by the degree of alternative splicing (the DAS-the number of detected splice variants per gene). Here, we successfully utilized the DAS to reveal biological pathways influenced by the alterations in AS in human liver tissue and the hepatocyte-derived malignant cell lines HepG2 and Huh7, thus employing the mathematical algorithm of gene set enrichment analysis. Furthermore, analysis of the AS profiles as abundances of single splice variants by using the graded tissue specificity index τ provided the selection of the groups of genes expressing particular splice variants specifically in liver tissue, HepG2 cells, and Huh7 cells. The majority of these splice variants were translated into proteins products and appeal to be in focus regarding further insights into the mechanisms underlying cell malignization. The used metrics are intrinsically suitable for transcriptome-wide AS profiling using long-read sequencing.

3.
Int J Mol Sci ; 24(21)2023 Oct 24.
Artigo em Inglês | MEDLINE | ID: mdl-37958484

RESUMO

The long-read RNA sequencing developed by Oxford Nanopore Technology provides a direct quantification of transcript isoforms. That makes the number of transcript isoforms per gene an intrinsically suitable metric for alternative splicing (AS) profiling in the application to this particular type of RNA sequencing. By using this simple metric and recruiting principal component analysis (PCA) as a tool to visualize the high-dimensional transcriptomic data, we were able to group biospecimens of normal human liver tissue and hepatocyte-derived malignant HepG2 and Huh7 cells into clear clusters in a 2D space. For the transcriptome-wide analysis, the clustering was observed regardless whether all genes were included in analysis or only those expressed in all biospecimens tested. However, in the application to a particular set of genes known as pharmacogenes, which are involved in drug metabolism, the clustering worsened dramatically in the latter case. Based on PCA data, the subsets of genes most contributing to biospecimens' grouping into clusters were selected and subjected to gene ontology analysis that allowed us to determine the top 20 biological processes among which translation and processes related to its regulation dominate. The suggested metrics can be a useful addition to the existing metrics for describing AS profiles, especially in application to transcriptome studies with long-read sequencing.


Assuntos
Processamento Alternativo , Sequenciamento de Nucleotídeos em Larga Escala , Humanos , Análise de Componente Principal , Sequenciamento de Nucleotídeos em Larga Escala/métodos , Perfilação da Expressão Gênica/métodos , Transcriptoma , Análise de Sequência de RNA/métodos , Fígado , Isoformas de Proteínas/genética , Hepatócitos , Linhagem Celular
4.
Genes (Basel) ; 14(11)2023 Nov 11.
Artigo em Inglês | MEDLINE | ID: mdl-38003008

RESUMO

Transcriptomics methods (RNA-Seq, PCR) today are more routine and reproducible than proteomics methods, i.e., both mass spectrometry and immunochemical analysis. For this reason, most scientific studies are limited to assessing the level of mRNA content. At the same time, protein content (and its post-translational status) largely determines the cell's state and behavior. Such a forced extrapolation of conclusions from the transcriptome to the proteome often seems unjustified. The ratios of "transcript-protein" pairs can vary by several orders of magnitude for different genes. As a rule, the correlation coefficient between transcriptome-proteome levels for different tissues does not exceed 0.3-0.5. Several characteristics determine the ratio between the content of mRNA and protein: among them, the rate of movement of the ribosome along the mRNA and the number of free ribosomes in the cell, the availability of tRNA, the secondary structure, and the localization of the transcript. The technical features of the experimental methods also significantly influence the levels of the transcript and protein of the corresponding gene on the outcome of the comparison. Given the above biological features and the performance of experimental and bioinformatic approaches, one may develop various models to predict proteomic profiles based on transcriptomic data. This review is devoted to the ability of RNA sequencing methods for protein abundance prediction.


Assuntos
Proteoma , Proteômica , Proteoma/genética , Proteômica/métodos , Perfilação da Expressão Gênica , Transcriptoma/genética , RNA Mensageiro/genética , RNA Mensageiro/metabolismo
5.
Metabolites ; 13(8)2023 Aug 03.
Artigo em Inglês | MEDLINE | ID: mdl-37623852

RESUMO

To represent the composition of small molecules circulating in HepG2 cells and the formation of the "core" of characteristic metabolites that often attract researchers' attention, we conducted a meta-analysis of 56 datasets obtained through metabolomic profiling via mass spectrometry and NMR. We highlighted the 288 most commonly studied compounds of diverse chemical nature and analyzed metabolic processes involving these small molecules. Building a complete map of the metabolome of a cell, which encompasses the diversity of possible impacts on it, is a severe challenge for the scientific community, which is faced not only with natural limitations of experimental technologies, but also with the absence of transparent and widely accepted standards for processing and presenting the obtained metabolomic data. Formulating our research design, we aimed to reveal metabolites crucial to the Hepg2 cell line, regardless of all chemical and/or physical impact factors. Unfortunately, the existing paradigm of data policy leads to a streetlight effect. When analyzing and reporting only target metabolites of interest, the community ignores the changes in the metabolomic landscape that hide many molecular secrets.

6.
Curr Issues Mol Biol ; 45(4): 3406-3418, 2023 Apr 13.
Artigo em Inglês | MEDLINE | ID: mdl-37185747

RESUMO

Database records contain useful information, which is readily available, but, unfortunately, limited compared to the source (publications). Our study reviewed the text fragments supporting the association between the biological macromolecules and diseases from Open Targets to map them on the biological level of study (DNA/RNA, proteins, metabolites). We screened records using a dictionary containing terms related to the selected levels of study, reviewed 600 hits manually and used machine learning to classify 31,260 text fragments. Our results indicate that association studies between diseases and macromolecules conducted on the level of DNA and RNA prevail, followed by the studies on the level of proteins and metabolites. We conclude that there is a clear need to translate the knowledge from the DNA/RNA level to the evidence on the level of proteins and metabolites. Since genes and their transcripts rarely act in the cell by themselves, more direct evidence may be of greater value for basic and applied research.

7.
Int J Mol Sci ; 24(3)2023 Jan 18.
Artigo em Inglês | MEDLINE | ID: mdl-36768195

RESUMO

The beginning of the twenty-first century witnessed novel breakthrough research directions in the life sciences, such as genomics, transcriptomics, translatomics, proteomics, metabolomics, and bioinformatics. A newly developed single-molecule approach addresses the physical and chemical properties and the functional activity of single (individual) biomacromolecules and viral particles. Within the alternative approach, the combination of "single-molecule approaches" is opposed to "omics approaches". This new approach is fundamentally unique in terms of its research object (a single biomacromolecule). Most studies are currently performed using postgenomic technologies that allow the properties of several hundreds of millions or even billions of biomacromolecules to be analyzed. This paper discusses the relevance and theoretical, methodological, and practical issues related to the development potential of a single-molecule approach using methods based on molecular detectors.


Assuntos
Genômica , Vírus , Genômica/métodos , Proteômica/métodos , Biologia Computacional , Metabolômica/métodos
8.
Life (Basel) ; 13(2)2023 Jan 17.
Artigo em Inglês | MEDLINE | ID: mdl-36836616

RESUMO

Foodborne bacteria interconnect food and human health. Despite significant progress in food safety regulation, bacterial contamination is still a serious public health concern and the reason for significant commercial losses. The screening of the microbiome in meals is one of the main aspects of food production safety influencing the health of the end-consumers. Our research provides an overview of proteomics findings in the field of food safety made over the last decade. It was believed that proteomics offered an accurate snapshot of the complex networks of the major biological machines called proteins. The proteomic methods for the detection of pathogens were armed with bioinformatics algorithms, allowing us to map the data onto the genome and transcriptome. The mechanisms of the interaction between bacteria and their environment were elucidated with unprecedented sensitivity, specificity, and depth. Using our web-based tool ScanBious for automated publication analysis, we analyzed over 48,000 scientific articles on antibiotic and disinfectant resistance and highlighted the benefits of proteomics for the food safety field. The most promising approach to studying safety in food production is the combination of classical genomic and metagenomic approaches and the advantages provided by proteomic methods with the use of panoramic and targeted mass spectrometry.

9.
Int J Mol Sci ; 24(1)2023 Jan 01.
Artigo em Inglês | MEDLINE | ID: mdl-36614211

RESUMO

A meta-analysis of the results of targeted quantitative screening of human blood plasma was performed to generate a reference standard kit that can be used for health analytics. The panel included 53 of the 296 proteins that form a "stable" part of the proteome of a healthy individual; these proteins were found in at least 70% of samples and were characterized by an interindividual coefficient of variation <40%. The concentration range of the selected proteins was 10−10−10−3 M and enrichment analysis revealed their association with rare familial diseases. The concentration of ceruloplasmin was reduced by approximately three orders of magnitude in patients with neurological disorders compared to healthy volunteers, and those of gelsolin isoform 1 and complement factor H were abruptly reduced in patients with lung adenocarcinoma. Absolute quantitative data of the individual proteome of a healthy and diseased individual can be used as the basis for personalized medicine and health monitoring. Storage over time allows us to identify individual biomarkers in the molecular landscape and prevent pathological conditions.


Assuntos
Proteínas Sanguíneas , Plasma , Proteoma , Humanos , Proteínas Sanguíneas/metabolismo , Ceruloplasmina/metabolismo , Espectrometria de Massas/métodos , Plasma/metabolismo , Proteômica
10.
Int J Mol Sci ; 25(1)2023 Dec 28.
Artigo em Inglês | MEDLINE | ID: mdl-38203578

RESUMO

This work demonstrates the use of a modified mica to concentrate proteins, which is required for proteomic profiling of blood plasma by mass spectrometry (MS). The surface of mica substrates, which are routinely used in atomic force microscopy (AFM), was modified with a photocrosslinker to allow "irreversible" binding of proteins via covalent bond formation. This modified substrate was called the AFM chip. This study aimed to determine the role of the surface and crosslinker in the efficient concentration of various types of proteins in plasma over a wide concentration range. The substrate surface was modified with a 4-benzoylbenzoic acid N-succinimidyl ester (SuccBB) photocrosslinker, activated by UV irradiation. AFM chips were incubated with plasma samples from a healthy volunteer at various dilution ratios (102X, 104X, and 106X). Control experiments were performed without UV irradiation to evaluate the contribution of physical protein adsorption to the concentration efficiency. AFM imaging confirmed the presence of protein layers on the chip surface after incubation with the samples. MS analysis of different samples indicated that the proteomic profile of the AFM-visualized layers contained common and unique proteins. In the working series of experiments, 228 proteins were identified on the chip surface for all samples, and 21 proteins were not identified in the control series. In the control series, a total of 220 proteins were identified on the chip surface, seven of which were not found in the working series. In plasma samples at various dilution ratios, a total of 146 proteins were identified without the concentration step, while 17 proteins were not detected in the series using AFM chips. The introduction of a concentration step using AFM chips allowed us to identify more proteins than in plasma samples without this step. We found that AFM chips with a modified surface facilitate the efficient concentration of proteins owing to the adsorption factor and the formation of covalent bonds between the proteins and the chip surface. The results of our study can be applied in the development of highly sensitive analytical systems for determining the complete composition of the plasma proteome.


Assuntos
Proteínas Sanguíneas , Proteômica , Humanos , Silicatos de Alumínio , Espectrometria de Massas
11.
Front Mol Biosci ; 9: 944639, 2022.
Artigo em Inglês | MEDLINE | ID: mdl-36545510

RESUMO

It has been shown that the best coverage of the HepG2 cell line transcriptome encoded by genes of a single chromosome, chromosome 18, is achieved by a combination of two sequencing platforms, Illumina RNA-Seq and Oxford Nanopore Technologies (ONT), using cut-off levels of FPKM > 0 and TPM > 0, respectively. In this study, we investigated the extent to which the combination of these transcriptomic analysis methods makes it possible to achieve a high coverage of the transcriptome encoded by the genes of other human chromosomes. A comparative analysis of transcriptome coverage for various types of biological material was carried out, and the HepG2 cell line transcriptome was compared with the transcriptome of liver tissue cells. In addition, the contribution of variability in the coverage of expressed genes in human transcriptomes to the creation of a draft human transcriptome was evaluated. For human liver tissues, ONT makes an extremely insignificant contribution to the overall coverage of the transcriptome. Thus, to ensure maximum coverage of the liver tissue transcriptome, it is sufficient to apply only one technology: Illumina RNA-Seq (FPKM > 0).

12.
Cells ; 11(22)2022 11 10.
Artigo em Inglês | MEDLINE | ID: mdl-36428976

RESUMO

Both biological and technical variations can discredit the reliability of obtained data in omics studies. In this technical note, we investigated the effect of prolonged cultivation of the HepG2 hepatoma cell line on its metabolomic profile. Using the GC × GC-MS approach, we determined the degree of metabolic variability across HepG2 cells cultured in uniform conditions for 0, 5, 10, 15, and 20 days. Post-processing of obtained data revealed substantial changes in relative abundances of 110 metabolites among HepG2 samples under investigation. Our findings have implications for interpreting metabolomic results obtained from immortal cells, especially in longitudinal studies. There are still plenty of unanswered questions regarding metabolomics variability and many potential areas for future targeted and panoramic research. However, we suggest that the metabolome of cell lines is unstable and may undergo significant transformation over time, even if the culture conditions remain the same. Considering metabolomics variability on a relatively long-term basis, careful experimentation with particular attention to control samples is required to ensure reproducibility and relevance of the research results when testing both fundamentally and practically significant hypotheses.


Assuntos
Metaboloma , Metabolômica , Humanos , Reprodutibilidade dos Testes , Células Hep G2 , Metabolômica/métodos , Cromatografia Gasosa-Espectrometria de Massas/métodos
13.
Curr Protein Pept Sci ; 23(4): 290-298, 2022.
Artigo em Inglês | MEDLINE | ID: mdl-35619260

RESUMO

AIMS: The main goal of the Russian part of C-HPP is to detect and functionally annotate missing proteins (PE2-PE4) encoded by human chromosome 18. To achieve this goal, it is necessary to use the most sensitive methods of analysis. BACKGROUND: However, identifying such proteins in a complex biological mixture using mass spectrometry (MS)-based methods is difficult due to the insufficient sensitivity of proteomic analysis methods. A possible solution to the problem is the pre-fractionation of a complex biological sample at the sample preparation stage. OBJECTIVE: This study aims to measure the detection limit of SRM SIS analysis using a standard set of UPS1 proteins and find a way to enhance the sensitivity of the analysis and to, detect proteins encoded by the human chromosome 18 in liver tissue samples, and compare the data with transcriptomic analysis of the same samples. METHODS: Mass spectrometry, data-dependent acquisition, selected reaction monitoring, highperformance liquid chromatography, data-dependent acquisition in combination with pre-fractionation by alkaline reversed-phase chromatography, selected reaction monitoring in combination with prefractionation by alkaline reversed-phase chromatography methods were used in this study. RESULTS: The results revealed that 100% of UPS1 proteins in a mixture could only be identified at a concentration of at least 10-9 М. The decrease in concentration leads to protein losses associated with technology sensitivity, and no UPS1 protein is detected at a concentration of 10-13 М. Therefore, the two-dimensional fractionation of samples was applied to improve sensitivity. The human liver tissue was examined by selected reaction monitoring and shotgun methods of MS analysis using onedimensional and two-dimensional fractionation to identify the proteins encoded by human chromosome 18. A total of 134 proteins were identified. The overlap between proteomic and transcriptomic data in human liver tissue was ~50%. CONCLUSION: The sample concentration technique is well suited for a standard UPS1 system that is not contaminated with a complex biological sample. However, it is not suitable for use with a complex biological protein mixture. Thus, it is necessary to develop more sophisticated fractionation systems for the detection of all low-copy proteins. This weak convergence is due to the low sensitivity of proteomic technology compared to transcriptomic approaches. Also, total mRNA was used to perform RNA-seq analysis, but not all detected mRNA molecules could be translated into proteins. This introduces additional uncertainty in the data; in the future, we plan to study only translated mRNA molecules-the translatome. Data is available via ProteomeXchange with identifier PXD026997.


Assuntos
Proteínas , Proteômica , Humanos , Fígado/metabolismo , Proteínas/metabolismo , Proteoma/metabolismo , Proteômica/métodos , RNA Mensageiro/análise , RNA Mensageiro/metabolismo , Tecnologia
14.
Molecules ; 27(8)2022 Apr 12.
Artigo em Inglês | MEDLINE | ID: mdl-35458675

RESUMO

We sought to identify the characteristic metabolite profile of blood plasma samples obtained from patients with preeclampsia. Direct high-resolution mass spectrometry was used to analyze samples from 79 pregnant women, 34 of whom had preeclampsia. We performed a comparative analysis of the metabolite profiles and found that they differed between pregnant women with and without preeclampsia. Lipids and sugars were identified as components of the metabolite profile that are likely to be associated with the development of preeclampsia. While PE was established only in the third trimester, a set of metabolites specific for the third trimester, including 2-(acetylamino)-1,5-anhydro-2-deoxy-4-O-b-D-galactopyranosyl-D-arabino-Hex-1-enitol, N-Acetyl-D-glucosaminyldiphosphodolichol, Cer(d18:0/20:0), and allolithocholic acid, was already traced in the first trimester. These components are also likely involved in lipid metabolism disorders and the development of oxidative stress.


Assuntos
Pré-Eclâmpsia , Biomarcadores , Feminino , Humanos , Metabolômica/métodos , Pré-Eclâmpsia/diagnóstico , Gravidez , Primeiro Trimestre da Gravidez , Estudos Retrospectivos
15.
J Pers Med ; 12(3)2022 Mar 16.
Artigo em Inglês | MEDLINE | ID: mdl-35330478

RESUMO

Within the Human Proteome Project initiative framework for creating functional annotations of uPE1 proteins, the neXt-CP50 Challenge was launched in 2018. In analogy with the missing-protein challenge, each command deciphers the functional features of the proteins in the chromosome-centric mode. However, the neXt-CP50 Challenge is more complicated than the missing-protein challenge: the approaches and methods for solving the problem are clear, but neither the concept of protein function nor specific experimental and/or bioinformatics protocols have been standardized to address it. We proposed using a retrospective analysis of the key HPP repository, the neXtProt database, to identify the most frequently used experimental and bioinformatic methods for analyzing protein functions, and the dynamics of accumulation of functional annotations. It has been shown that the dynamics of the increase in the number of proteins with known functions are greater than the progress made in the experimental confirmation of the existence of questionable proteins in the framework of the missing-protein challenge. At the same time, the functional annotation is based on the guilty-by-association postulate, according to which, based on large-scale experiments on API-MS and Y2H, proteins with unknown functions are most likely mapped through "handshakes" to biochemical processes.

16.
Data Brief ; 42: 108055, 2022 Jun.
Artigo em Inglês | MEDLINE | ID: mdl-35345844

RESUMO

The data was acquired from 3 normal human liver tissues by LC-MS methods. The tissue liver samples from male subjects post mortem were obtained from ILSBio LLC (https://bioivt.com/). Liver tissue was frozen in liquid nitrogen, transported and shipped on dry ice. The proteins were extracted and purified followed up by trypsin hydrolysis. The peptide mixture was aliquoted and analyzed by different LC-MS approaches: one-dimensional shotgun LC-MS, two-dimensional LC-MS, two-dimensional SRM SIS (Selected Reaction Monitoring with Stable Isotope-labeled peptide Standards). The Shotgun assay resulted in a qualitative in-depth human liver proteome, and a semi-quantitative iBAQ (intensity-based absolute quantification) value was calculated to show the relative protein content of the sample. Absolute quantitative concentrations of proteins encoded by human chromosome 18 using SRM SIS were obtained.

17.
Scientometrics ; 127(4): 1953-1967, 2022.
Artigo em Inglês | MEDLINE | ID: mdl-35221395

RESUMO

The paper describes a scheme for the comparative analysis of the sets of Pubmed publications. The proposed analysis is based on the comparison of the frequencies of occurrence of keywords-MeSH terms. The purpose of the analysis is to identify MeSH terms that characterize research areas specific to each group of articles, as well as to identify trends-topics on which the number of published works has changed significantly in recent years. The proposed approach was tested by comparing a set of medical publications and a group of articles in the field of personalized medicine. We analyzed about 700 thousand abstracts published in the period 2009-2021 and indexed them with MeSH terms. Topics with increasing research interest have been identified both in the field of medicine in general and specific to personalized medicine. Retrospective analysis of the keywords frequency of occurrence changes has shown the shift of the scientific priorities in this area over the past 10 years. The revealed patterns can be used to predict the relevance and significance of the scientific work direction in the horizon of 3-5 years. The proposed analysis can be scaled in the future for a larger number of groups of publications, as well as adjusted by introducing filters at the stage of sampling (scientific centers, journals, availability of full texts, etc.) or selecting a list of keywords (frequency threshold, use of qualifiers, category of generalizations). SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1007/s11192-022-04292-y.

18.
Front Genet ; 12: 674534, 2021.
Artigo em Inglês | MEDLINE | ID: mdl-34194472

RESUMO

The cutoff level applied in sequencing analysis varies according to the sequencing technology, sample type, and study purpose, which can largely affect the coverage and reliability of the data obtained. In this study, we aimed to determine the optimal combination of parameters for reliable RNA transcriptome data analysis. Toward this end, we compared the results obtained from different transcriptome analysis platforms (quantitative polymerase chain reaction, Illumina RNASeq, and Oxford Nanopore Technologies MinION) for the transcriptome encoded by human chromosome 18 (Chr 18) using the same sample types (HepG2 cells and liver tissue). A total of 275 protein-coding genes encoded by Chr 18 was taken as the gene set for evaluation. The combination of Illumina RNASeq and MinION nanopore technologies enabled the detection of at least one transcript for each protein-coding gene encoded by Chr 18. This combination also reduced the probability of false-positive detection of low-copy transcripts due to the simultaneous confirmation of the presence of a transcript by the two fundamentally different technologies: short reads essential for reliable detection (Illumina RNASeq) and long-read sequencing data (MinION). The combination of these technologies achieved complete coverage of all 275 protein-coding genes on Chr 18, identifying transcripts with non-zero expression levels. This approach can improve distinguishing the biological and technical reasons for the absence of mRNA detection for a given gene in transcriptomics.

19.
Data Brief ; 36: 107130, 2021 Jun.
Artigo em Inglês | MEDLINE | ID: mdl-34095379

RESUMO

The chromosome-centric dataset was created by applying several technologies of transcriptome profiling. The described dataset is available at NCBI repository (BioProject ID PRJNA635536). The dataset referred to the same type of tissue, cell lines, transcriptome sequencing technologies, and was accomplished in a period of 8 years (the first data were obtained in 2013 while the last ones - in 2020). The high-throughput sequencing technologies were employed along with the quantitative PCR (qPCR) approach, for data generation using the gene expression level assessment. qPCR was performed for a limited group of genes, encoded on human chromosome 18, for the Russian part of the Chromosome-Centric Human Proteome Project. The data of high-throughput sequencing are provided as Excel spreadsheets, where the data on FPKM and TMP values were evaluated for the whole transcriptome with both Illumina HiSeq and Oxford Nanopore Technologies MinION sequencing.

20.
J Pers Med ; 11(4)2021 Mar 29.
Artigo em Inglês | MEDLINE | ID: mdl-33805313

RESUMO

We used automatic text-mining of PubMed abstracts of papers related to obesity, with the aim of revealing that the information used in abstracts reflects the current understanding and key concepts of this widely explored problem. We compared expert data from DisGeNET to the results of an automated MeSH (Medical Subject Heading) search, which was performed by the ScanBious web tool. The analysis provided an overview of the obesity field, highlighting major trends such as physiological conditions, age, and diet, as well as key well-studied genes, such as adiponectin and its receptor. By intersecting the DisGeNET knowledge with the ScanBious results, we deciphered four clusters of obesity-related genes. An initial set of 100+ thousand abstracts and 622 genes was reduced to 19 genes, distributed among just a few groups: heredity, inflammation, intercellular signaling, and cancer. Rapid profiling of articles could drive personalized medicine: if the disease signs of a particular person were superimposed on a general network, then it would be possible to understand which are non-specific (observed in cohorts and, therefore, most likely have known treatment solutions) and which are less investigated, and probably represent a personalized case.

SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA
...